A large quantity of novel human antisense transcripts detected by LongSAGE
نویسندگان
چکیده
MOTIVATION Taking advantage of the high sensitivity and specificity of LongSAGE tag for transcript detection and genome mapping, we analyzed the 632 813 unique human LongSAGE tags deposited in public databases to identify novel human antisense transcripts. RESULTS Our study identified 45 321 tags that match the antisense strand of 9804 known mRNA sequences, 6606 of which contain antisense ESTs and 3198 are mapped only by SAGE tags. Quantitative analysis showed that the detected antisense transcripts are present at levels lower than their counterpart sense transcripts. Experimental results confirmed the presence of antisense transcripts detected by the antisense tags. We also constructed an antisense tag database that can be used to identify the antisense SAGE tags originated from the antisense strand of known mRNA sequences included in the RefSeq database. CONCLUSIONS Our study highlights the benefits of exploring SAGE data for comprehensive identification of human antisense transcripts and demonstrates the prevalence of antisense transcripts in the human genome.
منابع مشابه
LongSAGE analysis revealed the presence of a large number of novel antisense genes in the mouse genome
MOTIVATION Despite the increasing notions of the functional importance of antisense transcripts in gene regulation, the genome-wide overview on the ontology of antisense genes has not been obtained. Therefore, we tried to find novel antisense genes genome-wide by using our LongSAGE dataset of 202 015 tags (consisting of 41 718 unique tags), experimentally generated from mouse embryonic tail lib...
متن کاملNext-generation tag sequencing for cancer gene expression profiling.
We describe a new method, Tag-seq, which employs ultra high-throughput sequencing of 21 base pair cDNA tags for sensitive and cost-effective gene expression profiling. We compared Tag-seq data to LongSAGE data and observed improved representation of several classes of rare transcripts, including transcription factors, antisense transcripts, and intronic sequences, the latter possibly representi...
متن کاملLongSAGE analysis significantly improves genome annotation: identifications of novel genes and alternative transcripts in the mouse
MOTIVATION Owing to its increased tag length, LongSAGE tags are expected to be more reliable in direct assignment to genome sequences. Therefore, we evaluated the use of LongSAGE data in genome annotation by using our LongSAGE dataset of 202 015 tags (consisting of 41 718 unique tags), experimentally generated from mouse embryonic tail libraries. RESULTS A fraction of LongSAGE tags could not ...
متن کاملLarge-scale identification of novel transcripts in the human genome.
Although the sequencing of the human genome has been completed, the number and identity of genes contained within it remains to be fully determined. We used LongSAGE to analyze 660,357 human transcripts from human brain mRNA and identified expression of 17,409 known genes and >15,000 different transcripts that were not annotated in genome databases. Analysis of a subset of these unannotated tra...
متن کاملaRNA-longSAGE: a new approach to generate SAGE libraries from microdissected cells.
Large-scale gene expression analyses of microdissected primary tissue are still difficult because generally only a limited amount of mRNA can be obtained from microdissected cells. The introduction of the T7-based RNA amplification technique was an important step to reduce the amount of RNA needed for such analyses. This amplification technique produces amplified antisense RNA (aRNA), which so ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 22 20 شماره
صفحات -
تاریخ انتشار 2006